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Abstract 

This paper presents a procedure for testing the hypothesis that the underlying distribution 
of the data is elliptical when using robust location and scatter estimators instead of the sample 
mean and covariance matrix. Under mild assumptions that include elliptical distributions with¬ 
out first moments, we derive the test statistic asymptotic behaviour under the null hypothesis 
and under special alternatives. Numerical experiments allow to compare the behaviour of the 
tests based on the sample mean and covariance matrix with that based on robust estimators, 
under various elliptical distributions and different alternatives. This comparison was done look¬ 
ing not only at the observed level and power but we rather use the size-corrected relative exact 
power which provides a tool to assess the test statistic skill to detect alternatives. We also 
provide a numerical comparison with other competing tests. 
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1 Introduction 


The family of elliptically symmetric (or elliptically contoured) distributions generalizes the family 
of multivariate normal distributions. One advantage of the elliptical distributions is that they define 
a much broader class of multivariate distributions than the multivariate normal distributions so 
that they can serve as the basis for the development of more robust analyses. In fact, in many situ¬ 
ations, normal-theory analyses can be modified slightly retaining their validity across all elliptical 
distributions. The fact that many statistical procedures (including principal component analysis) 
yield superior performance when data support elliptical symmetry, motivates the consideration of 
testing for elliptical symmetry, instead of testing for other forms of multivariate symmetry. 

Zhu and Neuhaus (2003) introduced conditional test procedures for testing elliptical symmetry 
of a multivariate distribution. The conditional tests are exactly valid if the center and the shape 
matrix are known and are asymptotically valid if they are estimated, when fourth moments exist. It 
is worth noting that the test proposed by Zhu and Neuhaus (2003) are based on the sample mean 
and the sample covariance matrix, when the center and/or the shape parameters are unknown. 
This entails that the test statistics are asymptotically valid only for elliptical distributions such 
that IE(||Xf) < oo. In a robust framework, one frequently assumes that the sample belongs to a 
neighbourhood of a given central elliptical distribution Pq. The distributions P to be considered 
in the neighbourhood include heavy tailed distributions. Furthermore, in order to ensure Fisher- 
consistency of the proposed estimators, it is generally assumed that the resulting distribution P is 
also elliptical. So, it is of interest to check if the assumption of elliptical symmetry is valid without 
making moment assumptions. For this reason, in this paper, we propose a testing procedure that 
can be helpful to decide if a given sample has a common elliptical distribution without requiring 
moment conditions when consistent estimators of the unknown parameters are available. 

The paper is organized as follows. In Section O we introduce our proposal, while asymptotic 
distribution results under the null and under contiguous alternatives are provided in Section [3l 
A bootstrap method to compute effectively an approximation of the proposed test is presented in 
Section [H The results of a Monte Carlo study in dimensions p = 2 and 5 are summarized in Section 
[6l while a procedure to compute the test statistic is described in Appendix A. Besides, in Section 
[5l we analyse the behaviour of the proposed test statistic and the classical one under different 
distributions and sample sizes, so as to check their ability to reject the null hypothesis against a 
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set of alternatives. Proofs are relegated to Appendix B and C. 


2 Test statistic 

For the sake of completeness, we briefly recall the notion of elliptical symmetry. One can define 
symmetry in terms of structural properties of the distribution function, of the density function or 
of the characteristic function. The distribution of a p-dimensional random vector Y is called spher¬ 
ically symmetric when, for every px p matrix A £ 0{p) (the orthogonal group), the distribution of 
AY is the same as that of X. A random vector X G has an elliptically symmetric or elliptically 
contoured distribution, with parameters /z G and a non-singular matrix S, if Z = —/x) 

is a spherically symmetric random vector. If this elliptical distribution has finite second moments, 
then /X is the mean vector and S is up to a scalar the covariance matrix. More generally, under 
no moment conditions, the parameters jj, and S are called the location and the scatter matrix 
parameters, respectively. The associated characteristic function of an elliptical vector has the form 
4>{t) = for t G for some scalar function : M —)• M. Then, if second moment 

exists Var(X) = — 2?/)'(0)S, where for identifiability of S it is usually required that "0^(0) = ~ 1/2- 
We will write that X ~ Tp(/x, S, '0). If in the expression of (/>(t), we get the char¬ 

acteristic function of a normal distribution, so elliptical distributions are generalizations of normal 
distributions. For an overview on these distributions we refer to Fang and Anderson (1990) and 
Fang et al. (1990). 

Among the tests for spherical and elliptical distributions that have been introduced, we can 
mention Beran (1979), Tyler (1982), Baringhaus (1991), Fang et al. (1993), Koltchinskii and Li 
(1998), Koltchinskii and Sakhanenko (2000), Schott (2002), Zhu and Neuhaus (2003), Buffer and 
Park (2007) and more recently, Batsidis and Zografos (2013) and Batsidis et al. (2014). 

The goal of this section is to suggest a modification of the conditional test for ellipsoidal sym¬ 
metric multivariate distributions proposed in Zhu and Neuhaus (2003), which allows its application 
to data coming from heavy tailed elliptical distributions. This is particularly appealing in a robust 
framework, since many resistant statistical procedures assume that the underlying distribution is 
elliptical to get Fisher-consistent estimators. 

Let X be a p-dimensional random vector with distribution P. Given independent and identically 
distributed (i.i.d.) observations Xi,..., X„ such that X.j ~ X, denote by Pn the empirical measure 
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based on the sample points. Moreover, let Pnf = Pnif) stand for (1/n) Yl]=i fO^j) function 

f :RP 

From now on, denote £p the class of all elliptical contoured distributions. The hypothesis to be 
tested is Hq : P G £p, that is, Hq : X. ^ £p{f^, with parameters /x and S. For each hxed b 

and A define the functional 


TbMP) 


f f Ep^sin|ta^A (X — b 

J Sr) 'J 


2 

w{t)dtdv{a.) 


( 1 ) 


where rc : M —>• M is a weight function , = {a G : ||a|| = 1}, v the uniform distribution 

on Sp and Ep indicates that the expectation is taken with respect to the probability measure P. 
Note that if X ~ P = £p{n, then Z = — fi) is spherically distributed. Hence, the 

imaginary part of its characteristic function vanishes, that is, Ep [sin(ta^Z)] = 0 for any t G M and 
a G 5p which implies that = 0. 

When jji and X are known, the empirical version of Tfj,^s{P) will also be close to zero. This 
suggests to reject the null hypothesis Hq for large values of the test statistic TnitJ-, X), where 


r„(b,A) = 



VnPn 


sin 


ta"^A 


- 1/2 


(X — b) I w{t)dtdv{a) 


( 2 ) 


The statistic S) was considered by Zhu and Neuhaus (2003) when tc : M —)■ M has a compact 
support I and is a weighted version of that studied by Ghosh and Ruymgaart (1992). 

Usually the location and scatter matrix parameters are unknown. To overcome this problem, 
one may replace in TnifJ', X), /x and X by consistent estimators. Zhu and Neuhaus (2003) suggested 
to use the classical (cl) sample estimators, leading to the test statistic T^^cl = T)i(/i, X), where 
p, = X = X^”=iXj/n and X = S = X]”=i(Xj — X)(Xj — X)'^/n. As mentioned in Anderson et 
ai. (1986), for a sub-class of elliptical distributions, the maximum likelihood estimator of is X, 
while that of X is a constant multiple (depending on the family) of the sample covariance matrix, 
that is, the estimators have the same form as in the normal case, which justihes the above choice. 
Anderson et ai. (1986) studied the general situation of elliptical random matrices X = (Xi,..., X^) 
which includes the setting of independent columns we are considering. In the particular case of 
independent random vectors Xj, the class of elliptical distributions for which the sample mean and 
covariance matrix (except for a constant) are still the maximum likelihood estimators includes the 
situation when the density of Xi equals det(X)“^/^ ff((x — /i)^X“^(x — /i)) for some g :R ^ M"*", 
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where y^^‘^g{y) has a finite positive maximum. A condition ensuring the existence of hnite positive 
maximum is the continuity of g and that E ||X|p < oo (see Lemma 2 in Anderson et ah, 1986). On 
the other hand, if the underlying distribution is heavy tailed the values of these estimators may be 
distorted, rendering meaningless the test results. A solution to this problem is well known in robust 
statistics: fi and SI have to be estimated in a robust manner, to provide consistent estimators even 
if moments do not exist as in the case of a multivariate Cauchy distribution. 

The proposal in this paper, consists in plugging into the conditional test statistic T„(/.i,Sl) 
robust consistent estimators and of the location and scatter matrix S, respectively, to 

test Hq : P & £p. This leads to the following conditional robust based statistic 


^ n,m,V 


= r„. 


(m„,V„)= f I ^^/nPnSm (X - m„) | w{t)dtdv{aL), 


where Pn, a, w and v are defined as in ([2|). Denote V (P) and m(P) the functionals related to and 
m„, respectively, when X ~ P. Usually, under Hq, m(P) = and V(P) is, up to a multiplicative 
constant, equal to X. Then, the functional related to T„(m„, V„) is just Tjn(p),v(P)(-P) dehned in 
m which justihes the considered procedure. In Appendix 16.41 we describe a numerical procedure 
to compute this test statistic. 


Remark [2ll. The results on characteristic functions given in Ushakov (1999) give some insight 
with respect to the choice of the weight function w. Indeed, as dehned in Ushakov (1999) a 
characteristic function is said to be analytic if there exists a function g : C ^ C which is analytic 
in {| 2 :| < P} for some P > 0 and such that ^(t) = g{t) for any t G [—P, i?]. Theorem 1.7.7 in 
Ushakov (1999) states that if a characteristic function (p coincides with an analytic characteristic 
function <1> in some real neighbourhood of the origin, then they coincide for all real, that is, p = ^. 

Given a random vector X G denote Zg, the random variable = a^X“^/^(X — fi) and pa 
its characteristic function. Assume that for any a G Sp, pa is an analytic characteristic function. 
Theorem 1.7.7 in Ushakov (1999) entails that if, for any a G Sp, and for some 5 > 0 we have that 
Pa{t) = Re(y?a)(i) for t G (—(j, (j) then, the random variable Za has a symmetric distribution for 
all a. So, in the situation where the underlying distribution is such that all the projections have 
analytic characteristic functions, which includes the multivariate normal as well as the uniform 
distribution on the ball or in sphere, if the functional related to the test statistic is zero for some 
weight function w, with support around 0, then it will be 0 for any weight function. Thus, for 
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probability measures such that for all a E iSp, the distribution of has an analytic characteristic 
function the choice of w is not crucial as far as its support contains a neighbourhood of 0. 


3 Asymptotic behaviour of the test statistic 


In order to derive the limit behaviour of the proposed test statistic, we will assume that w{t) 
has bounded support contained in some finite interval X and we will introduce the empirical pro¬ 
cess given by Wn = |bVn(t,a) = y^P^sin (X — m„) , (t,a) E X x 5p|. Theorem [3ll 

states the asymptotic distribution of the process {Wnit^a)} under the null hypothesis, while The¬ 
orem [3l2 concerns the behaviour under local alternatives. Note that in our statement we do not 
require neither finite moment conditions to the random vector X, nor a rate of convergence of the 
shape matrix estimator. In this sense, our result provides an improvement over the proposal given 
in Zhu and Neuhaus (2003) who required finite fourth moment. As shown in Section [UJ the lack 
of moments may distort the results of the classical test based on while when using robust 

estimators the test is still reliable. On the other hand, when second moments exist, if we take 
m„ and V„ as the sample mean and covariance matrix, respectively. Theorem [3jl provides the 
asymptotic distribution of Wn under slightly more general conditions than those given in Theorem 
2.1 of Zhu and Neuhaus (2003). 


3.1 Behaviour under the null hypothesis 

Theorem [3l 1. Let I he a bounded interval. Assume that X ~ P = Tp(/i, X, ^), i.e., that Hq holds 
and that yjlog Hudu < oo, where is the smallest value P > 1 such that P(||S“^/^(X — fJ.)\\ > 
H) < that is Hs = max(l, P^^(l—<5^)) withFs the distribution function of S = ||S“^'^^(X—p)||. 
Moreover, assume that P(X = /x) = 0 and that and m„ are consistent estimators of X and /i, 
respectively, such that m„ admits, for some function a^n : M —>■ M, a Bahadur expansion as follows 


y^{mn - /x) = — ^ (Xj - p) Om [W'Zi (Xj - /x) 
^ 1=1 


+ 


(3) 


where Ep||X-^/2 (x _ fj,) f a^(||5]”i/2 (x _ ^) ||) < oo. 

Then, the process Wn = | Wn(t, a) = y/nPn sin ta^V„ (X — m„) , (t, a) E X x 5p| converges 

in distribution to a centered continuous Gaussian process W = {>V(t,a) , (t,a) E X x 5p} with 
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covariance kernel given hy Ep[A:(i,a, 
A:(t, a, x) = sin (x —/x) — 


X)fc(s,b, X)], for (i, a) £ I x Sp and (s,b) £ I x Sp, 
(x - n) am (x - fi) ||^. 


where 


Remark [311. Note that the classical location estimator, that is, the sample mean corresponds 
to am{t) = 1 and this is the situation considered in Theorem 2.1 of Zhu and Neuhaus (2003) 
which requires the existence of fourth moments. On the other hand, as shown in Hampel et 
al. (1986), if lUn is an estimator related to a functional m(P) that is affine equivariant there 
exists a real function am ■ 1^ such that its influence function equals IF(xo,m, Pq) = 

(xq — /x)a!m(||X~^/^(xo — /.t)||). In most cases, the influence function is bounded so the assump¬ 
tion Ep||X-i/2 (X - /i) ||2 a^(||X-i/2 (X - fx) II) < oo is satisfied and no moment conditions are 
required. Besides, as it is well known, under mild conditions, the influence function allows to 
obtain a Bahadur expansion for the location estimator (see Fernholz, 1983). In particular, for 
the S—estimator (see Lopuhaa, 1989), we have that am{t) = P~^Us{t) where Us{t) = il;s{t)/t, 
ipsit) = p'sit)- Let Go be the spherical distribution related to Pq, that is, Gq is the distribution of 
X~^/^(xo — /x). Then, the constant f3 is given by 


/3 


Ego 


P. 




(4) 


Usually, the influence function is computed at the central Gaussian distribution, so that Gq = 
A^(0,I). A common choice for the p—function dehning the S'—estimator is the Tukey function 
dehned as ps{y) = {c^/6)uim{l - [l - (y/c)^] ^ , 1}. Hence, 'ipsiv) = V [^ - l[-c,c]{y), 

V’s(y) = [1 - 6 (yVc^) + 5 (//c'^)] II[-c,c](y) and Us{y) = [l - (yVc^)] ^ I[-c,c](y) (see Lopuhaa, 
1989). 


Remark [31 2. Let us show that the assumption J'q -y/log du ^ oo where E(||X ^ (X! — /^)ll P 

P&) < is fulfilled for some distributions where fourth moments may not exist. For the sake of 
simplicity, we will assume fi = 0 and S = Ip, since otherwise, we may consider Z = X“^/^(X — fx). 

It is clear that if Ep||Z|p < oo, then < (Ep||Z|p)^''^^/(5 and Jq ^/\og^T^du < oo. More 
generally, if EpUZH*^ < oo, for some > 0, then Hg < (Ep||Z||^)^^'^so y/logTJudu < oo. 

As an example of elliptical distributions satisfying the condition \/log du < oo, let us 
consider the multivariate t—distribution with k degrees of freedom, i.e., Z ~ 7),,fc(0, 1 ). As is well 
known Z has no finite fourth moment when A: < 4. Besides, Z has the same distribution as 
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where Vk = ky~^ ~ xt W ~ Np{0,l), where xt stands for the chi-square distribution with k 
degrees of freedom. Then, if the two expectations on the right hand side of ([5]) below exist, by the 
independence between W and y we have that 

( II wih \ 

= k^ E|| W|r E (Ufc™) , (5) 

’’^k J 

where m = z//2. Note that E||W||'^ < oo for any u > 0. On the other hand, using that Vk ~ xl; 
have that, for any 0 < i' < k, E(n^ < oo, which entails that for the multivariate t—distribution 
y/log Hu du < oo. This result shows that our assumption is a very mild one since it includes for 
instance, the multivariate Cauchy distribution. 

The following Corollary gives the distribution of the test statistic under the null hypothesis. 

Corollary [3]. 1. Under the assumptions of Theorem Of if w{t) is a weight function with bounded 
supportZ, then Tn{mn,'Vn) fg j-j^W‘^{t,aL)w{f) dt dv{a) where the process yV(t, a) is defined 
in Theorem [3l 1. 

3.2 Behaviour under the alternative 

Regarding the consistency of the test, it is well known that if X ~ S, ip), then for any a £ Sp, 
Za = a^S“^/^(X —/r) has a symmetric distribution, but the converse is not true. A typical example 
being a random variable Y with distribution uniform on the set {x : — 1 < Xj < 1 for all j} which 
satishes that /Y(y) = /Y(~y) ensuring that all projections a^Y are symmetric. In this situation, 
for observations having a symmetric distribution but not a spherical one, the test type-statistics 
considered in Zhu and Neuhaus (2003), Ghosh and Ruymgaart (1992) or in this paper will not reject 
the null hypothesis. This is a feature of any projection-pursuit procedure based on the property 
that any projection of a spherical distributed random vector is symmetric. 

On the other hand, as mentioned in Zhu and Neuhaus (2003), if Ep sin(ta^5]“^/^(X — /x)) ^ 0, 
for some t £ X and a £ Sp, then > 0. Therefore, using that r„(/x,5])/n Tp^^siP) 

together with the consistency of m„ and V„, we obtain that the test statistic T„(m„,V„) will 
converge to oo and the test is consistent against global alternatives. 

To derive the distribution of the test statistic under a set of alternatives, denote as sin^'^)(t) the 




j—th derivative of the sinus function at t. Recall that V(P) and m(P) stand for the functionals 
related to and vn.n, respectively, when X ~ P. We will assume that m(P) is affine equivariant. 
To strength the dependence on the sample, we will denote as Pri,x the empirical distribution of 
the sample Xi,... ,X„ and and the estimators based on that sample, that is, V„^x = 

V(Pn,x) and m.„^x = ni(P„^x)- Moreover, assume that the i.i.d. observation X* = Xj„ are 


such that Xj = Xj„ = Zj + Yjn “ for some a > 0 where Zj are i.i.d. such that Zj 


Pn = 


Tp(/r, S,^). Due to the equivariance of the location estimator and without loss of generality, we 
may assume that E(Y) = 0. Effectively, if we define X* = X* — E(Y)/n“, using that m ^ = 


Hin.x ~ E(Y)/n", we have that P„ sin ^^^(X — 

to obtain the asymptotic behaviour of the test statistic under these alternatives we may assume 
that E(Y) = 0. 


= Pn sin 


taTV-^/'(X-m„,x) 


, so 


Theorem [3l2. Let w{t) be a weight function with support X and Z ~ Pq = Sp{p, X, ijj). Assume 
that the following assumptions hold 


a) Zj are i.i.d. such that Zj ~ Z and define Xj = Xj„ = Zj+Yj n ", where a > 0 and E(Y) = 0. 

b) P(Z =/r) = P(X =/r) = 0, 

c) y/log Hudu < oo, where Hs is the smallest value > 1 such that P(||S“^/^(Z — /r)|| > 
H) < that is Hs = max(l,P^^(l — h^)) where Fs is the distribution function of S = 

||X-V2(Z-,,)||. 

d) V„^x and m„^x are such that V„^x X and m„^x — R 0. 

e) The functional m(P) is such that the Bahadur expansion |3) holds at Pq, that is, for some 

function : M —)■ M, m„,^z satisfies (0) where Ep^||X“^/^ (Z — fx) |p a^(||X”^/^ (Z — p,) ||) < 

oo. Furthermore, 

v^(m„,x - fJ') = v^(m„,z - r) + op(l) . (6) 


f) there is a positive integer m such that EHYlp™ < oo and for some 5m > 0 


sup \Bm{t,a)\ 

xSp 


sup 

{t,a)eX^-m xSp 


E|(^ta'^X"^/2Y)™sin(™)(ta'^X"^/2(2 _ /x))| / 0, 


where X^ stands for a 5—neighborhood ofX = [—n, n]. 
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Let I be the smallest positive m satisfying f). IfY^ and Z are not independent assume in addition 
that for any 1 < s < £, fg \/logHuidu < oo, where q = £j{£ — s). Then, when a = l/(2£), 


T„(m„,V„)-^ j j (w{t,a) + ^Bi{t,a)^ w{t)dtdv{a), 
where the process VV’(t, a) is defined in Theoremf^l. 


4 Bootstrap method 


As mentioned in Zhu and Neuhaus (2003), the asymptotic behaviour of the test statistic does not 
allow to compute easily p—values, so a bootstrap method is needed. Zhu and Neuhaus (2003) 
describe a bootstrap procedure when the center /x and the shape parameter S are known and when 
pL is estimated using the mean of the observations. When the center and the shape matrix are 
unknown, a slight modification to the method considered in Zhu and Neuhaus (2003) is needed to 
adapt to the resistant location estimators, since the estimated shape parameter does not influence 
the distribution of the test statistic. 

One possibility is to adapt the bootstrap statistic defined in Zhu and Nehaus (2003) to the 
present setting. For that purpose, assume, as in Section [3l that the affine equivariant location 
estimator admits a Bahadur expansion given by (I3|). Let d = a^Vn ' . Using that 


Pn sin 


td'^ (X - m„) 


= cos 


tdL^ (/X - m„) 


Pn sin 


— Sin 


td'^ (m„ - /x) 


Pr, COS 


td^ (X - /x) 
td^ (X - /x) 


the first order von Mises expansion ([3|) and the fact that /x, we have that 


'/nPn sin 


td^ (X 



^/nPnSm (ta^z) - y^sin {ta^P„ [za^ (||z||)]} [ cos (ta^z) j + op(l). 


where z = S (X — /x) ~ u||X (X — /x) || and u ~ hl{Sp). 

However, the implementation of this bootstrapping method when using a location and scatter 
matrix robust estimators implies the computation of am{t), which equals 1 for the sample mean, but 
may be more complex when using robust estimators. For instance, when using 5—estimators the 
function am{t) involves the calculation of the constant /3 defined in ([1|). In robustness, [3 is usually 
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computed under the standard normal distribntion. However, in our situation, the constant /3 must 
be computed under the spherical distribution related to the underlying elliptical distribution of the 
sample. This is a drawback of this bootstrap method since the correct distribution is unknown. To 
avoid this vicious circle, we consider a bootstrap statistic that can be computed as follows: 


Step 1 Generate i.i.d. random vector u* ~ U{Sp) and define = (ui,...,u„). Let 


X* = Ui||V„^/"(Xi-m„ 
tion of X* 


be the bootstrap observations and P* the empirical distribu- 


Step 2 Define lT*(t, a) = y/nP*s\u ta^V* (X* — m*) , where m* and V* are the location and 
scatter matrix robust estimators of the bootstrapped sample X* and calculate the statistic 
^n,m*,v*(Un) = fW*(t,af w(t)dtdv(a). 


Step 3 Repeat Steps 1 and 2 Nboot times to get Nboot values of T*^^, Y*(Un ^), 1 < j < Nboot. 

Step 4 Estimate the p—value as p = k/{Nboot + 1) where k is the number of 
greater or equal than 


Through this algorithm we obtain a sample of bootstrap replicates , 1 < j < Nboot whose 
distribution approximates the distribution of Tn^m,w under the null hypothesis, as desired. 

The proof of the asymptotic distribution of the boostrap procednre is an interesting topic which 
we leave for future research. 


5 Finite sample distribution of the test statistic when p = 2 

We generate independent observations Zi,..., Z„, Z, ~ P according to different elliptical models 
under the null hypothesis. Let S) be the multivariate p—dimensional t—distribution with 

k degrees of freedom, which includes the multivariate Cauchy distribution when fc = 1, and denote 
U{Sp) and U{Bp) the uniform distributions over the unit circle and the unit ball, respectively. 
Consider the null hypotheses : P = A^p(0, 1 ), : P = 0.9A^p(0,1) + 0.17^_i(0, 1 ), : 

P = 0.9iVp(0,I) + 0.17^, 3 ( 0 , 1 ), : P = 7 ;, 3 ( 0 , 1 ), 77^'') : P = U{Sp), : P = U{Bp) and 

77^") : P = rp,i(0,I). 

For each null hypothesis 77g , we consider different alternative hypothesis 77| ^, related to 

(?) 

the original distribution P in the null hypothesis. Under 77j the observations are generated as 
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Xj = Zj + AYj with Zi ^ P independent of Yj and Y = (Yi, Y 2 )^ where ~ Xi independent 
among each other and A = 0.5, 1 and 1.5. We also studied the behaviour of the statistics under 
two fixed alternatives and Under the data have the distribution of a random 

vector with two independent components, £’(1) and A(0,1), where £’(A) denotes the exponential 
distribution of parameter A, that is, with mean value 1/A, while the alternative corresponds 

to the distribution of a random vector with two independent components, £{Y} and £1(1/2), that 
is, with expectation 1 and 2, respectively. The first three alternatives were studied in Zhu and 
Neuhaus (2003), while Koltchinskii and Li (1998) studied the capability of their proposal to detect 
Hf^ and Hf\ 

In all cases, we perform N = 1000 replications for samples of size n = 20, 50, 100 and 200. For 
each sample, we compute the test statistics with the mean and sample covariance matrix, denoted 
by Tn^chi with the Donoho-Stahel estimators of location and scatter, denoted by Tn,m and with 
the 5—estimators of location and scatter, denoted by Tn,s- Both robust estimators are calibrated 
to attain 50% breakdown point. We choose as weight function w{t) = ;,](£) with 6 = 2. 

In Figures [H to [71 the density estimates of test statistics Tn^ch, T/^os and Tn,s are plotted under 
the null hypotheses to and under their corresponding alternatives. The density estimates 
were evaluated using the normal kernel. 

As expected, in most cases the classical test statistics is more sensitive to the lack of elliptical 
symmetry of the alternative distributions than the robust test statistics. However, for n = 20 in 
the considered situations all statistics, the classical and the robust ones, fail to distinguish the 
symmetric distribution under the null hypothesis from those considered in the alternatives. Indeed, 
for this sample size all the density estimates are almost overlapping. The tests detect some of the 
selected alternatives for n = 50. For n = 100, in all cases the ability of the test statistics to make 
out the nature of the underlying distribution increases and this fact becomes more clear for n = 200. 
For n = 100 and 200, the densities corresponding to the non-elliptical distributions generated under 
and are shifted to the right from those of the test statistic under the null hypothesis. 

This effect is less visible for n = 50. Hence, one could expect that the tests statistics will work 
well under these circumstances. On the opposite, except for Figure [6l the densities of all the test 
statistics under the null hypothesis and under the alternative ^ are almost overlapping. As 
expected, this performance is even worst for the classical test under Hq and where the 

distribution of the test statistic does not allow to distinguish between the null and the alternative 
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hypotheses even for n = 200 (see Figure [7|). Hence, one can not expect a good performance of the 

(i) 

classical tests in this case. A similar conclusion can be held for ( for n = 100, while for n = 200 
the behaviour of the test statistic depends on the distribution of Zj. 

This numerical approach suggests that, for p = 2, small sample sizes and values of A smaller 
than 0.5 when considering alternatives of the form Zj + AYj should not be considered in the Monte 
Carlo study presented in Section [6l 

6 Monte Carlo study 

In this section, we report the results of a simulation study conducted to analyse the performance 
of the test statistic obtained using robust location and scatter estimators with respect to that 
based on the sample mean and covariance matrix. The weight function considered in this Monte 
Carlo study equals w{t) = I[_6,6](t) where b = 2. Based on the results reported in Section [5] 
regarding the finite-sample distribution of the test statistics, we carried out NR = 500 replications 
for sample sizes equal to n = 200. To perform the bootstrap method described in Section 01 we 
used Nboot = 1000 bootstrap samples. The nominal level was set equal to a = 0.05. Besides, we 
also compare our test procedure with other known methods for testing elliptical symmetry, when 
p = 5. In what follows, TiHoiTn) and 7r//j(T„) stand for the observed probabilities of rejection 
of the test based on the statistic T„ under the null hypothesis and under the alternative Hi, 
respectively. As in Batsidis et al. (2014), we also examine if the empirical size is significantly 
different from the nominal level a = 0.05. To be more precise, let vr be such that vr. 

Then, using the central limit theorem, the hypothesis : vr = a is rejected at level 7 versus 
: vr / a if vr//g(r„) ^ [ai(a), 02(0)] where aj{a) = a + {a{l - j = 1, 2. 

If : vr = a = 0.05 is not rejected, the testing procedure based on Tn is considered accurate. 
Note that if -KHoiTn) < ai{a) the testing procedure is conservative, while if vrj|/g(T„) > 02(0) the 
test is liberal. In all Tables reporting the observed frequencies of rejection, we indicate with * those 
cases in which the observed empirical frequencies of rejection are different from the nominal level 
with a significance level 7 = 0.01. 

From now on, let S) denote the multivariate p—dimensional t—distribution with k de¬ 

grees of freedom, which includes the multivariate Cauchy distribution when k = 1, and denote 
hl[Sp) and hl{Bp) the uniform distributions over the unit circle and the unit ball, respectively. 
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Denote also as Xu the chi-square distribution with v degrees of freedom. 


6.1 Simulation study in dimension p = 2 

We generate independent observations Zi,...,Z„, Zi ^ P according to different elliptical mod¬ 
els under the null hypothesis. Consider the null hypotheses : P = A^p(0,I), : P = 

0.9Wp(0,I) +0.17;,i(0,I), : P = 0.9iVp(0,1) + 0.17;,3(0,1), : P = : 

P = U{Sp), : P = U{Bp) and : P = Tp,i{0, !)• 

(?) (?) 

For each null hypothesis Hq \ we consider different alternative hypothesis related to 

the original distribution P in the null hypothesis. Under the observations are generated as 

Xj = Zi + AYj with Zj ~ P independent of Yj and Y = ( 11 , 12 )^ where ~ Xi independent 
among each other and A = 0.5, 1 and 1.5. We also studied the behaviour of the statistics under 
two fixed alternatives and Under the data have the distribution of a random 

vector with two independent components, £’(1) and Y(0,1), where £{\) denotes the exponential 
distribution of parameter A, that is, with mean value 1/A, while the alternative corresponds 

to the distribution of a random vector with two independent components, £{P} and £1(1/2), that 
is, with expectations 1 and 2, respectively. The first three alternatives were studied in Zhu and 
Neuhaus (2003), while Koltchinskii and Li (1998) studied the capability of their proposal to detect 
and H f\ 

For each sample, we compute the p—values of the test statistics obtained using the mean and 
sample covariance matrix, denoted by Tn^ch-, the Donoho-Stahel estimators of location and scatter, 
denoted by Tn,m and the 5—estimators of location and scatter, denoted by Tn^- Both robust 
estimators are calibrated to attain 50% breakdown point. The corresponding frequencies of rejection 
are reported in Tables [U and [2l where A = 0 corresponds to the observations generated according 
to the null hypothesis. 

Taking as reference the first row of Table [H as expected, we observe some loss of power of 

( 2 ) 

the classical test based on T^^cl under the alternatives of the distributions considered in Hq 
and ^ where the data follow heavier tailed distributions. On the other hand, the opposite 
is observed when considering U{Sp) and U{Bp). The extreme situation is found under Hq and 
its alternatives, since the classical test completely looses its power. Indeed, in this case this test 
is unable to distinguish between data coming from the elliptical distribution 72,i(0,I) from data 
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generated under its alternatives ^ for A = 0.5,1 and 1.5. Besides, Table [2] shows that for both 

families of robust estimators of location and scatter matrix similar results are obtained, either in 

level or power. Indeed, with both robust estimators, the proposed tests lead to comparable results 

to those obtained with the classical test for ' for j = 1,3,5 and 6, even when for j = 1 and 

3 there is some loss of power under However, the robust tests outperform the behaviour 

observed with under the alternatives of and getting larger frequencies of rejection. 

Finally, from Table [2] we conclude that using Tn^os and Tn^s the decision rule has a good performance 

( 7 ) 

and is informative even under Hq ' and its alternative hypotheses. 

We have also considered two other alternatives also studied in Batsidis et aJ. (2014) which 
gave power 1 as and for that reason the results are omitted in the Tables. One of 

the alternatives, denoted as , is obtained generating random vectors with two independent 
components with a common beta distribution Be(5,l). The other one, corresponds to the 

distribution of a random vector with distribution 0.5 Ap(0,I) + 0.5 Np{fj,,'S) with fj. = (1,2) and 


S = 



-4 

5 


Ti.CL 





^l.A 




A 

0 

0.5 

1 

1.5 



-^0 

0.060 

0.478 

1.000 

1.000 

1.000 

1.000 

^(2) 

-^0 

0.086 

0.220 

0.748 

0.888 

1.000 

1.000 

7:^(3) 

-^0 

0.048 

0.412 

0.994 

0.998 

1.000 

1.000 

7 (r( 4 ) 

-^0 

0.044 

0.110 

0.794 

0.988 

1.000 

1.000 

^( 5 ) 

-^0 

0.057 

0.892 

1.000 

1.000 

1.000 

1.000 

77^(6) 

-“0 

0.054 

1.000 

1.000 

1.000 

1.000 

1.000 

77 ^( 7 ) 

-^0 

0.056 

0.058 

0.062 

0.076 

1.000 

1.000 


Table 1 : Frequency of rejection for the bootstrap test Tn^ch for n = 200 and dimension p = 2. * indicates that the 
frequency of rejection is significantly different from the nominal level. 

As noted before, the exact sizes of the test statistics, i.e., HoiTn^ch)■, '^Ho{Tn,Ds) and -KHoiTrifi) 
fluctuate around the fixed level ol = 0.05. To help in the visual comparison of the power perfor¬ 
mance of the three test statistics, as in Batsidis et ai. (2014), Table [3] reports the size-corrected 
relative exact powers PHxK^n,m-,T'n,ci) and (?n,S) ^u.cl)- For two test statistics, Tn'^ and Tn\ 
PHi{Tn\Tn"^) was defined in Morales et al. (2004) as 

PH, , Tf)) = ( - l) X 100 , (7) 
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-^ l.A 


Hf' 

nr 

A 

0 
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1 

1.5 



0 

0.5 

1 

1.5 



-^0 

0.048 

0.256 

0.990 

1.000 

1.000 

1.000 

0.060 

0.270 

0.988 

1.000 

1.000 

1.000 

^(2) 

-^0 

0.058 

0.283 

0.984 

1.000 

1.000 

1.000 

0.060 

0.309 

0.980 

1.000 

1.000 

1.000 

-^0 

0.042 

0.244 

0.984 

1.000 

1.000 

1.000 

0.050 

0.264 

0.986 

1.000 

1.000 

1.000 

^(4) 

-^0 

0.056 

0.206 

0.862 

0.998 

1.000 

1.000 

0.056 

0.212 

0.876 

0.998 

1.000 

1.000 

^(5) 

-^0 

0.062 

0.608 

1.000 

1.000 

1.000 

1.000 

0.068 

0.552 

1.000 

1.000 

1.000 

1.000 

^(6) 

-^0 

0.048 

1.000 

1.000 

1.000 

1.000 

1.000 

0.050 

1.000 

1.000 

1.000 

1.000 

1.000 

-^0 

0.048 

0.066 

0.464 

0.832 

1.000 

1.000 

0.046 

0.068 

0.440 

0.840 

1.000 

1.000 


Table 2: Frequency of rejection for the bootstrap test r„,DS and Tn,s for n = 200 and dimension p = 2 . -k indicates 
that the frequency of rejection is significantly different from the nominal level. 


with DniiTn) = 7rjyj(T„) — iTHoiTn). This measure allows to clarify the fluctuations in the powers 
which are more difficult to observe in Tables [T] and [2j 




PHi (7 n , DS , Ti . Ol ) 




PHl{T 

T.,S ; ^ TIjCl ) 





rr ( j ) 

-^l.A 


Hf> 

m 


fjO) 


Ht"’ 

Hf’ 

A 

0.5 

1 

1.5 



0.5 

1 

1.5 



-^0 

-50.239 

0.213 

1.277 

1.277 

1.277 

-49.761 

-1.277 

0.000 

0.000 

0.000 

tt(2) 

67.910 

39.879 

17.456 

3.064 

3.064 

85.821 

38.973 

17.207 

2.845 

2.845 

^(3) 

-^0 

-44.506 

-0.423 

0.842 

0.630 

0.630 

-41.209 

-1.057 

0.000 

-0.210 

-0.210 

^C4) 

-^0 

127.273 

7.467 

-0.212 

-1.255 

-1.255 

136.364 

9.333 

-0.212 

-1.255 

-1.255 

jj( 5 ) 

-^0 

-34.611 

-0.530 

-0.530 

-0.530 

-0.530 

-42.036 

-1.167 

-1.167 

-1.167 

-1.167 

-^0 

0.634 

0.634 

0.634 

0.634 

0.634 

0.423 

0.423 

0.423 

0.423 

0.423 

^(7) 

-^0 

800.000 

6833.333 

3820.000 

0.848 

0.848 

1000.000 

6466.667 

3870.000 

1.059 

1.059 


Table 3: Size corrected relative exact power for the robust bootstrap tests r„,DS and T^.s with respect to the 
classical one r„,cL for n = 200 and dimension p = 2 . 


Table [3] shows that in most cases, larger values of (T„, T„^cl) are obtained with the Donoho- 
Stahel estimators over the S'—estimators leading to the conclusion that the test based on the 
Donoho-Stahel estimators is a preferable choice. As noted before, the robust tests outperform the 
classical one specially for alternatives close to the null hypothesis under and . 

This performance was expected for the heavy tailed distributions and Hq , but it is also 
present under moderate tails as those of the 7^^3(0,I) since fourth moments do not exist. On the 
other hand, as expected, the classical test has a superior behaviour under a Gaussian distribution, 
when A = 0.5. The better performance of T„^cl for A = 0.5, is also observed under Hq and the 
uniform distribution over the unit circle. With respect to the detection of the alternatives 
and , all procedures are almost equivalent. Based on the simulated results obtained for the 
considered distributions, we recommend the test statistic based on the Donoho-Stahel estimators. 
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For this reason, in dimension p = 5 we only compare the test statistics T„^os and T„,cl- 


6.2 Simulation study in dimension p = 5 


In order to compare the performance of the two test statistics T„_cl and Tn^os, under the null 
hypothesis, we generate n independent observations Zi,..., Z„, Zj ^ P,7ii £ following different 
elliptical distributions as follows : P = Np{0,l), : P is the Pearson type II distribution 

generated as \/FU where U ~ hl{Sp) and V ~ Be{p/2,m), with m = 3/2, : P = 7^^5(0,I) 

and : P = 7^,i(0, 1 ). As in Section 16.11 we consider observations Xj, i = 1,..., n generated 
under the alternative hypotheses with A = 0.25,0.5,0.75,1 and 1.5. Besides, we studied 

the performance under four fixed alternatives for j = 1 to 4 defined as follows. Under 

the data have the distribution of a random vector with p independent components, the first 
p — 1 having distribution EiP) and the last one A(0,1). This distribution corresponds to in 

dimension p = 2. The second fixed alternative corresponds to the distribution of a random 

vector X with p independent components each of them with distribution T(l). Under H* , Xj ~ X 
where X is a random vector with p independent components with common distribution Be{5, 1). 
Finally, corresponds to the situation in which X has p independent components, the first 

p — 1 with common distribution S{1) and the last one 7^,1- 

The frequencies of rejection are reported in Tabled! where A = 0 corresponds to the observations 
generated according to the null hypothesis. Besides, Table [5| reports the size-corrected relative exact 
powers PHi{Tn,Ds,Tn,cL) as defined in ([7|). 
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1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

^(3) 

-^0 

0 . 080 * 
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0.290 
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0.996 

1.000 
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1.000 

1.000 

1.000 

^(4) 
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0.058 

0.078 
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0.094 

0.118 

1.000 

1.000 

1.000 

1.000 
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Hf' 

Hf' 
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0 

0.25 
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0.75 

1 

1.5 





rr(l) 

^0 

0.034 

0.044 

0.428 

0.984 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

^( 2 ) 

0.036 

0.982 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

^( 3 ) 

0.040 

0.052 

0.270 

0.870 

0.996 

1.000 

1.000 

1.000 

1.000 

1.000 

^(4) 

-^0 

0.054 

0.056 

0.126 

0.398 

0.766 

0.994 

1.000 

1.000 

1.000 

1.000 


Table 4: Frequency of rejection for the bootstrap test T„,cl and r„_DS for n = 200 and dimension p = 5 , a = 0 . 05 . 
★ indicates that the frequency of rejection is significantly different from the nominal level. 
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From Tabled! one observes that, except for the Pearson distribution, the observed level of the 
classical procedure, T^Ho(Tn,cL), is slightly higher than the nominal one. However, Tn ,cl leads to 
a liberal test only for the 7^^5(0,I) distribution. On the contrary, T^Ho{'^n,Ds) is smaller than the 
nominal level, except for the Cauchy distribution in which case, the exact size is close to 0.05. 
However, in none of the considered situations the observed frequencies of rejection are significantly 
different from the nominal level a = 0.05. For the Cauchy distribution, as expected, the classical 
test is non-informative when considering the alternatives On the other hand, both procedures 

detect the alternatives for j = 1,... ,4. Table [5] shows the advantage of the procedure based 

on r„^os over that based on Tn^cL, except for F^j^oos- inadequate behavior of Tn^cL for the 

Cauchy distribution shown by a power almost equal to the level for the alternatives is more 
clear when comparing the values of the size-corrected relative exact powers pH i {Tn,Ds,Tn, cl)- 



PHi (Ti.DS, Ti.Cl) 
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-^l.A 







0.25 

0.5 

0.75 

1 

1.5 


-64.286 

-32.765 

1.279 

2.766 

2.766 

2.766 

2.766 

2.766 

2.766 


-0.421 

1.048 

1.048 

1.048 

1.048 

1.048 

1.048 

1.048 

1.048 

^0 

-220.000 

9.524 

5.598 

4.367 

4.348 

4.348 

4.348 

4.348 

4.348 

^(4) 

-^0 

-90.000 

227.273 

1128.571 

1877.778 

1466.667 

0.425 

0.425 

0.425 

0.425 


Table 5: Size corrected relative exact power of the robust bootstrap test T„,ds with respect to the classical Tn, cl 
one, when n = 200, p = 5 and a = 0.05. 


6.3 Comparisons with other tests for elliptical symmetry 

Taking into account the better performance of the procedure based on Tn^us over that based on T„^cl 
and Tn^s, in this section, we compare the conditional test based on T„^os with some other methods 
found in the literature. The simulation conditions are similar to those described in Sections 16.11 
and 16.21 

As mentioned in Section O there is a wide literature on methods to test for elliptical symmetry. 
According to the simulation power studies performed in Huffer and Park (2007), none of the tests 
introduced in Manzotti et al. (2002), Schott (2002) or Huffer and Park (2007) is uniformly superior 
for detecting departures from the null hypothesis. On the other hand, Batsidis et al. (2014) also 
showed that their proposal is comparable in power to that defined in Schott (2002). 

The purpose of the numerical study in this section, is to show that the proposed test statistic 
is a useful option to the previously defined methods, in particular when moments do not exist. 
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Since there is no superior test statistic, we decided to choose for the comparison a test statistic 
which can be easily computed and has a tractable null distribution. For this reason, we excluded 
the test defined in Koltchinskii and Sakhanenko (2000) as well as the statistic defined in Beran 
(1979). With respect to the test proposed in Batsidis and Zografos (2013), their method helps 
to decide departures from a specihc elliptical model, while our procedure is designed to detect 
departures from the whole family of elliptical distributions. For this reason, it is not included in 
the comparison. 

On the contrary, the test defined by Schott (2002) is easy to compute since it is based on a 
fourth moment statistic denoted as Tn^scn- Besides, this statistic is asymptotically where 
depends on the dimension of the data, but not on the underlying null elliptical distribution. Based 
on the simulation studies reported in Schott (2002), Buffer and Park (2007) and Batsidis et al. 
(2014), the test based on T„^sch has observed level close to the nominal one and good empirical 
power. Besides, as our procedure, the test statistic is affine invariant. It is worth noting, that the 
asymptotic behavior of Tn^scu is derived for distributions having finite moments up to order eight 
so that it will be sensitive to departures from this assumption, even if the distribution is elliptical. 

We also include in the comparison the test statistic, T„,bat; recently introduced in Batsidis 
et al. (2014) that is based on a power divergence family of statistics depending on a parameter 
A. According to the simulation results in Batsidis et al. (2014), we select A = 1 (similar results 
were obtained for A = 2/3). As noted by these authors, Tn, bat has approximately a chi-square 
distribution with degrees of freedom depending on the sample size and may be computed in a 
simple way. As mentioned in Batsidis et al. (2014), this test statistic has a very good power for a 
variety of alternatives, even when it is not affine invariant. 

The observed frequency of rejection and the size corrected relative exact powers for T^^bat are 
given in Tables [6] and [7] for p = 2 and in Tables [8] and [U when p = 5. Analogous quantities for 
Tn,scH are reported in Tables [TOl and fTTl for p = 2 and in Tables [12] and [13] for p = 5. 

As expected, when the underlying distribution has no moments, the test based on Tn,sen becomes 
non informative, since it relies on the assumption of existence of eight order moments. The same 
happens when the data are generated according to the multivariate Student distributions 72 , 3 ( 0 , 1 ) 
and 75 , 5 ( 0 ,1) since these distributions do not satisfy the assumptions in Schott (2002). When p = 2, 
in the four other situations, even if the test reaches in most cases the desired level, it has difficulties 
to detect the selected alternatives, specially the alternatives and In dimension p = 5, 
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-^0 
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^(7) 

-^0 

0.926* 

0.892 

0.904 

0.892 

0.456 

1.000 

0.974 

0.976 


Table 6: Frequency of rejection for the test defined in Batsidis et al. (2014) for n = 200 and dimension p — 2 with 
A = 1. * indicates that the frequency of rejection is significantly different from the nominal level. 
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-57.113 

-56.900 

rrC3) 

-^0 

-83.168 
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-3.758 
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-15.756 

1.891 

1.891 

-55.252 

1.891 
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-0.630 

1.(7) 

-^0 

-288.889 

-105.289 

-104.337 

-149.370 

92.227 

-94.958 

-94.748 


Table 7: Size corrected relative exact power for test based on the statistic T„,bat defined in Batsidis et al. (2014) 
with respect to the robust bootstrap test Tn,uB for n = 200 and dimension p = 2 with A = 1. 
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0.888 
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^( 3 ) 

-^0 

0.036 
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0.050 
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0.516 

0.972 

0.936 

1.000 

0.888 
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^( 4 ) 

-^0 

0.800* 

0.804 

0.798 

0.808 

0.810 

0.846 
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1.000 

0.888 

0.990 


Table 8: Frequency of rejection for the test defined in Batsidis et al. (2014) for n = 200 and dimension p — 5 with 
A = 1. * indicates that the frequency of rejection is significantly different from the nominal level. 


for the Pearson distribution, the level and power performance of Tn,sen is analogous to that of our 
procedure, while for the normal distribution, the test based on rn,Ds has a better detection power 
except for A = 0.25. For the chosen fixed alternatives, , j = 3,4 our procedure has much 
better power. 

With respect to the proposal given in Batsidis et al. (2014), the test based on rn,BAT becomes 
non-informative when contaminating with a Cauchy distribution or when the data follow a Cauchy 
distribution, which can be explained by the fact that the test is based on the sample mean and the 
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PHi [Tn 

,BAT, Tn 

,DS) 







^ l.A 




Hr 

Hr 

nr 

A 

0.25 

0.5 

0.75 

1 

1.5 





-^0 

- 60.000 

- 91.371 

- 83.158 

- 37.474 

0.621 

- 5.797 

0.828 

- 10.766 

- 0.207 

^( 2 ) 

-^0 

- 93.658 

- 1.452 

1.660 

1.660 

1.660 

- 4.979 

1.660 

- 9.959 

0.622 

77 - C 3 ) 

-^0 

- 100.000 

- 93.913 

- 79.036 

- 49.791 

- 2.500 

- 6.250 

0.417 

- 11.250 

- 0.625 

7 ^( 4 ) 

-^0 

100.000 

- 102.778 

- 97.674 

- 98.596 

- 95.106 

- 85.624 

- 78.858 

- 90.698 

- 79.915 


Table 9: Size corrected relative exact power for test based on the statistic T^.bat defined in Batsidis et al. ( 2014 ) 
with respect to the robust bootstrap test Tn,us for n = 200 and dimension p = 5 with A = 1 . 






Tn 

,SCH 







rr ( 7 ) 

-^ l.A 


nr 

A ' l ) 

Hf 

nr 


A 

0 

0.5 

1 

1.5 





T / U ) 

-^0 

0.032 

0.074 

0.218 

0.312 

0.548 

0.282 

0.140 

0.864 

7 ) 7 ( 2 ) 

-^0 

0 . 504 * 

0.490 

0.444 

0.394 

0.548 

0.282 

0.140 

0.864 

rr(3) 

-^0 

0.054 

0.050 

0.182 

0.296 

0.548 

0.282 

0.140 

0.864 

7)r(4) 

-^0 

0 . 098 * 

0.094 

0.094 

0.158 

0.548 

0.282 

0.140 

0.864 

rr ( 5 ) 

-^0 

0.064 

0.112 

0.310 

0.380 

0.548 

0.282 

0.140 

0.864 

rr( 6 ) 

-^0 

0.054 

0.276 

0.382 

0.410 

0.548 

0.282 

0.140 

0.864 

rr(7) 

-^0 

0 . 396 * 

0.394 

0.410 

0.432 

0.548 

0.282 

0.140 

0.864 


Table 10: Frequency of rejection for the test defined in Schott (2002) for n = 200 and dimension p = 2. * indicates 
that the frequency of rejection is significantly different from the nominal level. 
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77(3) 

-^l.A 



nr 

Hf’ 

_^( 4 ) 

Hf 

A 

0.5 

1 

1.5 





-^0 

- 79.808 

- 80.255 

- 70.588 

- 45.798 

- 73.740 

- 88.656 

- 12.605 

7j(2) 

-^0 

- 106.222 

- 106.480 

- 111.677 

- 95.329 

- 123.567 

- 138.641 

- 61.783 

77-(3) 

-^0 

- 101.980 

- 86.412 

- 74.739 

- 48.434 

- 76.200 

- 91.023 

- 15.449 

7)r(4) 

-^0 

- 102.667 

- 100.496 

- 93.631 

- 52.331 

- 80.509 

- 95.551 

- 18.856 

77^(5) 

-^0 

- 91.209 

- 73.774 

- 66.311 

- 48.401 

- 76.759 

- 91.898 

- 14.712 

77^(6) 

-^0 

- 76.681 

- 65.546 

- 62.605 

- 48.109 

- 76.050 

- 90.966 

- 14.916 

M 7 ) 

-^0 

- 111.111 

- 96.635 

- 95.408 

- 84.034 

- 111.975 

- 126.891 

- 50.840 


Table 11: Size corrected relative exact power for test based on the statistic Tn.scu defined in Schott (2002) with 
respect to the robust bootstrap test T„,ds for n = 200 and dimension p — 2. 


sample covariance matrix. On the other hand, for the number of replications considered the level 
0.05 is not attained for the normal distribution and for data uniformly distributed over the unit 
circle, in dimension p = 2. This fact was also observed in Table 10 of Batsidis et al. (2014) for 
the normal distribution with 10000 replications. Table [7] shows that, as for the Schott’s test, our 
method outperforms the procedure based on T^^bat in the majority of the cases considered. Similar 
conclusions for the considered distributions and alternatives are obtained in dimension p = 5, as 
shown in Table [H 
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,SCH 









-^l.A 




Hf’ 

Hf’ 

nr 

A 

0 

0.25 

0.5 

0.75 

1 

1.5 





-^0 

0.040 

0.064 

0.182 

0.550 

0.790 

0.900 

0.864 

0.872 

0.314 

0.516 

^(2) 

-^0 

0.056 

0.468 

0.900 

0.944 

0.954 

0.964 

0.864 

0.872 

0.314 

0.516 

^(3) 

-^0 

0.054 

0.046 

0.052 

0.150 

0.410 

0.800 

0.864 

0.872 

0.314 

0.516 

^(4) 

-^0 

0.974* 

0.974 

0.974 

0.974 

0.970 

0.946 

0.864 

0.872 

0.314 

0.516 


Table 12: Frequency of rejection for the test defined in Schott (2002) for n = 200 and dimension p = 5. * indicates 
that the frequency of rejection is significantly different from the nominal level. 
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Tn,SCH,Tn 
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rr(j) 

^l.A 




Hf’ 

Hf’ 

, (4) 

m 

A 

0.25 

0.5 

0.75 

1 

1.5 





-^0 

140.000 

-63.959 

-46.316 

-22.360 

-10.973 

-14.700 

-13.872 

-71.636 

-50.725 

7:7(2) 

-^0 

-56.448 

-12.448 

-7.884 

-6.847 

-5.809 

-16.183 

-15.353 

-73.237 

-52.282 

7(7(3) 

-^0 

-166.667 

-100.870 

-88.434 

-62.762 

-22.292 

-15.625 

-14.792 

-72.917 

-51.875 

rr(4) 

-^0 

-100.000 

-100.000 

-100.000 

-100.562 

-102.979 

-111.628 

-110.782 

-169.767 

-148.414 


Table 13: Size corrected relative exact power for test based on the statistic Tn.scH defined in Schott (2002) with 
respect to the robust bootstrap test Tn.ns for n = 200 and dimension p = 5. 

These facts are highlighted in Tables [11] and [T3| that report the size corrected relative powers 
PHi (Tn,scHj Tn,Ds)- The negative values reported in all cells confirm the better performance of Tn^m- 
Note that even if, for the Cauchy distribution, the test proposed in Batsidis et al. (2014) has a 
positive value when A = 0.25 and p = 5, the test is non-informative having power almost constant 
for Hence, in this case, the size corrected size does not provide a good measure to compare 

the test statistics. 

It is worth noticing that our conclusions regarding the better performance of the test based on 
the Donoho-Stahel estimators are valid only for the considered distributions and alternatives. A 
more extensive simulation study would be necessary to conclude that, in general, Tn^us should be 
preferred. This interesting comparison may be object of future work. 


6.4 Simulation study in dimension p = 5 with n = 50. 

In Section (5] we noticed that for small sample sizes the distribution of the test statistic does not 
allow to distinguish the elliptical distributions from those considered in the alternative. For that 
reason, in the simulation study reported in Sections 16.II and 16.21 we choose as sample size n = 200. 
To complement the results obtained in Section 16.21 and to study the effect of a smaller sample size 
on the decisions taken, we report here the observed frequencies of rejection for the test based on 
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the sample mean and covariance matrix, T^^cl and for that based on the Donoho-Stahel estimators, 
Tn,us, when p = 5 and n = 50, which represents a challenging situation due to the ratio between 
sample size and dimension. As in Section [6.31 we also compare their performance with that of the 
test statistic, T„,bat) introduced in Batsidis et al. (2014) and with the test defined by Schott (2002), 
Tn,sen- The simulation conditions as well as the considered alternatives are described in Section 


The corresponding frequencies of rejection are reported in Tables [TTl [15] and [T6l where A = 0 
corresponds to the observations generated according to the null hypothesis. Besides, Table [I7| 
reports the size-corrected relative exact powers p HiiTn,us, Tn, cl) as defined in (7), while Tables [TSi 
and [T^ report the size corrected relative powers PHi{Tn, bat, Tn,us) and (T„ gen, T„ ^s). Note that 
a positive value of size corrected relative power PHi{Tn,i,Tn^ 2 ) indicates that the test based on Tnp 
has a better detection capability than that based on and the size of its advantage is quantified 
by PHi {Tn,i,Tn, 2 )- Similarly, a negative value of pHi {Tn,i,Tn, 2 ) provides a measure of the deficiency 
of Tnp with respect to T„,2- 
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rrO) 

-^ l.A 



Ht"’ 

Hf’ 

Hf’ 

- u ( 4 ) 

Ht 

A 

0 

0.25 

0.5 

0.75 

1 

1.5 





-^0 

0.054 

0.068 

0.180 

0.528 

0.844 

0.988 

1.000 

1.000 

1.000 

0.998 

^( 2 ) 

-^0 

0.044 

0.440 

0.984 

1.000 

1.000 

1.000 

1.000 

1.000 

1.000 

0.998 

^( 3 ) 

-^0 

0.128* 

0.134 

0.186 

0.370 

0.652 

0.940 

1.000 

1.000 

1.000 

0.998 

^( 4 ) 

-^0 

0.228* 

0.230 

0.240 

0.276 

0.290 

0.362 

1.000 

1.000 

1.000 

0.998 
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-^ 1,A 




Hf’ 

Hf’ 

- u ( 4 ) 

Ht 

A 

0 

0.25 

0.5 

0.75 

1 

1.5 





rr(l) 

^0 

0.052 

0.050 

0.086 

0.236 

0.542 

0.898 

0.992 

1.000 

0.972 

0.980 

^(2) 

-^0 

0.060 

0.204 

0.858 

0.992 

1.000 

1.000 

0.992 

1.000 

0.972 

0.980 

^( 3 ) 

-“0 

0.068 

0.050 

0.090 

0.190 

0.360 

0.774 

0.992 

1.000 

0.972 

0.980 

^( 4 ) 

-^0 

0.044 

0.036 

0.048 

0.098 

0.150 

0.330 

0.992 

1.000 

0.972 

0.980 


Table 14: Frequency of rejection for the bootstrap test Tn, cl and r„,DS for n = 50 and dimension p = 5, u = 0.05. 
* indicates that the freqnency of rejection is significantly different from the nominal level. 


Table dH shows that, even for this small sample size, the robust procedures allow to detect 
the considered alternatives keeping the exact sizes of the test statistic, i.e., T^Ho{Tn,us) around 
the nominal level a = 0.05. As in Section 16.21 and 16.31 we indicate with a -k those cases, in 
which the observed empirical frequencies of rejection are different from the nominal level with a 
significance level 7 = 0.01. As expected, the test based on the sample mean and covariance matrix 
outperforms that based on the Donoho-Stahel estimators under the normal distribution due to the 
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,BAT 









-^1,A 




Hf’ 

Hf’ 

nr 

A 

0 

0.25 

0.5 

0.75 

1 

1.5 





-^0 

0.016* 

0.016 

0.030 

0.040 

0.084 

0.248 

0.152 

0.302 

0.144 

0.274 

^(2) 

-^0 

0.016* 

0.036 

0.186 

0.440 

0.544 

0.640 

0.152 

0.302 

0.144 

0.274 

^(3) 

-^0 

0.016* 

0.024 

0.036 

0.036 

0.074 

0.184 

0.152 

0.302 

0.144 

0.274 

^(4) 

-^0 

0.438* 

0.408 

0.362 

0.374 

0.352 

0.324 

0.152 

0.302 

0.144 

0.274 


Table 15: Frequency of rejection for the test defined in Batsidis et al. (2014) for n = 50 and dimension p = 5 with 
A = 1. * indicates that the frequency of rejection is significantly different from the nominal level. 
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-^l.A 




Hf’ 

Hf’ 

. ( 4 ) 

hT 
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0 

0.25 

0.5 

0.75 

1 

1.5 





-^0 

0.048 

0.034 

0.034 

0.076 

0.142 

0.312 

0.238 

0.258 

0.074 

0.708 

77 ^( 2 ) 

-^0 

0.060 

0.074 

0.270 

0.432 

0.514 

0.586 

0.238 

0.258 

0.074 

0.708 


0.038 

0.036 

0.044 

0.060 

0.084 

0.182 

0.238 

0.258 

0.074 

0.708 


0.964* 

0.966 

0.966 

0.956 

0.948 

0.900 

0.238 

0.258 

0.074 

0.708 


Table 16: Frequency of rejection for the test defined in Schott (2002) for n = 50 and dimension p — 5. * indicates 
that the frequency of rejection is significantly different from the nominal level. 


loss of efficiency of the robust estimators. The advantage of T„,cl is also observed for the Pearson 
distribution, in particular, when A = 0.25 and A = 0.5. These two facts are consistent with the 
behaviour described in Section [5l where for dimension p = 2 and n = 50, the distribution of the test 
statistic has troubles to distinguish between the null hypothesis and close alternatives for most of 
the considered elliptical distributions. Note that the test based on the sample mean and covariance 
matrix becomes liberal under 7^^5(0,I). Besides, the test based on the robust estimators shows its 
advantage for this distribution, except for A = 0.25 where Tn^os does not succeed in detecting the 
hypothesis, leading to a large negative value on the size corrected relative power. On the other 
hand, under the Cauchy distribution Tn ,cl is non-informative, while Tn ,ds is able to distinguish all 
the alternatives except when A = 0.25 and 0.5. These facts become more evident in Table flTl where 
most size corrected relative powers are positive for distributions different from the normal. The 
large negative value obtained at A = 0.25 for the Cauchy distribution can be explained by means 
of two facts. The first one is that rn,cL has power almost constant, so that the denominator is close 
to 0, while the second one is that the power of Tn^ns decreases at A = 0.25 with respect to its size. 
Note that, given two test statistics and Tn^ 2 , when the test based on is non-informative, a 
negative value of the size corrected relative power PHi (7n,i) 7n,2) does not provide a good measure 
to conclude the benehts of Tn ^2 over 
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With respect to the test statistics, T„,bat and T^^sch, both procedures loose their capability of 
detection under the Cauchy distribution, since their behaviour relies on the existence of moments. 
On the other hand, the test statistic proposed in Batsidis et al. (2014) is conservative in all situa¬ 
tions except for the Cauchy distribution where it is liberal. The same conclusions obtained 

when n = 200 are preserved in the actual setting, that is, the procedure proposed in this paper 
outperforms these competitors in the majority of the situations considered. As mentioned in Sec¬ 
tion [6]3j our conclusions on the benefits of T„^os are valid only for the considered distributions and 
alternatives. Quite surprisingly, even for this small sample size the procedure based on T^^ds shows 
a reasonable performance probably due to the bootstrap method used to compute the p—value. 
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H^j) 

^l.A 




nr 



A 

0.25 

0.5 

0.75 

1 

1.5 






-114.286 

-73.016 

-61.181 

-37.975 

-9.422 

-0.634 

0.211 

-2.748 

-1.695 


-63.636 

-15.106 

-2.510 

-1.674 

-1.674 

-2.510 

-1.674 

-4.603 

-3.564 

rrC3) 

-^0 

-400.000 

-62.069 

-49.587 

-44.275 

-13.054 

5.963 

6.881 

3.670 

4.828 

^(4) 

-500.000 

-66.667 

12.500 

70.968 

113.433 

22.798 

23.834 

20.207 

21.548 


Table 17: Size corrected relative exact power of the robust bootstrap test Tn,DS with respect to the classical Tn, cl 
one, when n = 50, p = 5 and a = 0.05. 
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Hf’ 

_^(4) 
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0.25 

0.5 

0.75 
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1.5 





rrUJ 

-^0 

-100.000 

-58.824 

-86.957 

-86.122 

-72.577 

-85.532 

-69.831 

-86.087 

-72.198 

rr(2) 

-^0 

-86.111 

-78.697 

-54.506 

-43.830 

-33.617 

-85.408 

-69.574 

-85.965 

-71.957 

rr(3) 

-^0 

-144.444 

-9.091 

-83.607 

-80.137 

-76.204 

-85.281 

-69.313 

-85.841 

-71.711 

rr(4) 

-^0 

275.000 

-2000.000 

-218.519 

-181.132 - 

139.860 

-130.169 

-114.226 

-131.681 

-117.949 


Table 18: Size corrected relative exact power for test based on the statistic Tn, bat defined in Batsidis et al. (2014) 
with respect to the robust bootstrap test Tn.us for n = 50 and dimension p = 5 with A = 1. 
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-^0 

600.000 

-141.176 

-84.783 

-80.816 

-68.794 

-79.787 

-77.848 

-97.174 

-28.879 

^(2) 

-^0 

-90.278 

-73.684 

-60.086 

-51.702 

-44.043 

-80.901 

-78.936 

-98.465 

-29.565 

^(3) 

-^0 

-88.889 

-72.727 

-81.967 

-84.247 

-79.603 

-78.355 

-76.395 

-96.018 

-26.535 

^(4) 

-^0 

-125.000 

-50.000 

-114.815 

-115.094 

-122.378 

-176.582 

-173.849 

-195.905 

-127.350 


Table 19: Size corrected relative exact power for test based on the statistic Tn, sen defined in Schott (2002) with 
respect to the robust bootstrap test Tn,BS for n = 50 and dimension p = 5. 
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Appendix A: Numerical computation of the test statistic 


The defined test statistic r„,m,v involves an integral that may be calculated numerically. In dimen¬ 
sion 2, the approximation described below is easy to perform. Assume that w has compact support 
I = [—6, b] and split it in a grid of Nx points tj. We consider M random directions {aj}^^ in Sp 
generated according to a uniform distribution on the sphere. Once the robust estimates m„ and 
are obtained from the sample, for each ti on the grid and each generated random direction aj, 

w(ti). Then, we approximate the desired 
test statistic by 2bsur{Sp) hj where sur{Sp) denotes the surface area of the 

sphere in of radius 1. 

To get an alternative expression for the test statistic, we will restrict our attention to the 

_1 /Q 

situation where w{t) = ;,](t)/(26). Let Zi{a) = a'^Vn ' (X, — m^), then 


we compute lij = | y/nPn sin 


Ua^vJ^^ (X-m„) 


j {^/nPn {sin {tZi{a))})'^ w{t)dt 


1 

2bn 


_^ rb 

/ sm(tZi{a))sin{tZj{a)) dt . 


Using that sin(x) sin(?/) = (cos(x — y) — cos(x -|- y))/2 and denoting U{{j{a) = Zi{a) + Zj{a) = 
(Xi + Xj - 2m„) and [7^ (a) = Zi{a) - Zj{a) = (X^ - X^), we get that 



w{t)dt = 


1 sin [bU-^ (a)) sin {bU^- (a)) 

u±{^) ’ 


which implies that 


^n,m,V — 


— E 

2n ^ 






sin (X,-Xj 

(X, - X,) 


- EiY, 


sin (b (Xi -b X^ - 2m, 


6UTV„^/^(Xi + X.-2m„ 
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where U = {Ui ,..., Upf ~ U{Sp) = Up. Define = h^(Xj - /i, —(Xj - /i), V) and = 

(P{'X.i — fi,'Kj — where (i^(x, v, S) = (x —v)"^S“^(x—v) is the squared Mahalanobis distance. 

Since U ~ U{Sp), we have that Y = {p — 1) 2 ~ (see Muirhead, 1982, 

pp.38) and Ui = Yj\/V — 1 + y^. Moreover, since U^v f7i for any ^ Sp have that 
(X, - Xj) ~ and (Xj + X^ - 2m„) ~ These facts entail that 


where the function / : M —^ M is dehned as 


!{n) = 


( sin 1 u 


Y 


^p-l+Y2 


Y 




Vp-^+y" 


U 


- 


sin (u Ui) 
uU^ 


with sin(rt)/u = 1 if u = 0. Then, using that D-j = D--, Df- = D^-, = 0 and /(O) = 1, we get 

a simpler expression for the test statistic given by 


T)i,m,V 



1-/ 


bD± 


+ 2E[/(' 


bD 




j=l t i=l ) 

To summarize, in order to compute the test statistic, the user only has to evaluate the function / 
by Monte Carlo over a grid of points. 


Appendix B: Proofs of Theorems [3].l and [3]. 2 


Proof of TheoremOI. Using that sin(x—?/) = sin(x) cos(?/)—cos(x) sin(y) and since a'’^Vn^^^(X— 
m„) = a'’^Vn^^^(X — p) — — /.i), we have that -y/nP^sin ta'^Vn— m„) = 

V^Wi^n{t,a.) - y/nlT 2 ,n(Ta) where 


lUgn(t,a) = cos 

ta'^V„^/^(m„ - n) 

Pn sin 

taTV-V2(X-/.)' 

W 2 ,n{t,aL) = sin 

ta'^V~^/2(m„ - n) 

Pn COS 

ta^V-V2(X-/.)' 


Denote as a, A) = cos [ta^A(X —/x)] and ^(t,a. A) = Ep cos [ta^A(X —/x)]. Note that 
since X ~ Tp(/i, S, i/'), we have that C{t, a, A) = -(/^(t^a^ASA^a) which entails that C{t, a, = 
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The Dominated Convergence Theorem implies that 


lim Ep sup 

tGR,ae5p 


COS 


[ta"^A(X — /^)] — cos (ta^Ti — fj, 


= 0 , 


which, together with the fact that V„ S, entails that sup^g^ ,|C(t,a,V„'/')-C(t,a,X-V2)| Ji 
0, that is, 


sup 

tGM,aG<Sp 


Ep cos 


ta^V-V2(X-/x) 



(B.l) 


P m 

Let II • lls be a norm in the space Ad of symmetric positive definite matrices. Then, as V„ —)■ X, 
we have that for n large enough with high probability, ||Vn^'^^ — X~^/^||s < 1. Let J- = {/(x) = 
cos(ta"^A(x — fi)), t S X,a S Sp,A £ : ||A — X“^/^||s < 1} and Gc = {/b(x) = cos(b'^(x — 

/^)), b £ ; ||b|| ^ Then, for some C ^ 0, c£ Gc' Xote that the functions y*] 3 (x) are such 

that the map b i—)■ /b(x) is continuous for all hxed x and its envelope T(x) = sup|n,||<c l/b(x)| 
satisfies that F £ Li{P) since T < 1. Hence, using Lemma 3.10 in van de Geer (2000), we have that 
the class Gc has finite bracketing number N^j{e,Gcj Li{P)), which entails that Gc is a Glivenko- 
Cantelli class of functions, that is, sup|H 3 ||<(^ \Pn cos [b'^(X — /x)] — Pcos [b'^(X — /x)] | 0. This 

convergence implies that sup^gp .jG<Sp I Zn{t,a,Vn^^‘^) - 0. Hence, using (IB.ip . 

we get that 

sup |Z„(f,a,V-^/2)-^(t2)| ^0. (B.2) 

^ 3 .GSp 

From the fact that lim,j_>,o sinu/u = 1 , the consistency of and m„, we get that 


sup 

tGl,aG<Sp 


sm 


fa'^Vn ^ (m„ - /x) 


- /x) 


T - 1 


0 . 


(B.3) 


Using that ^/n{nln — /x) = Op(l), together with ()B.2h and (IB.3p . we conclude that 


sup 

tGl,aG<Sp 


VnW2,n{t,a.) - ^p{P) ta'^X ^/n{nln - fi) 


which together with the fact that m„ admits a Bahadur expansion leads to 

1 ^ 

v^W^ 2 ,n(La) = ^(t")ta^X-i/2 J^(Xi-/x) _ ;^) ||) + a) , (B.4) 

^ 2=1 


where sup^gp^^g^^ \R 2 ,n{t,a)\ 0. 
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Similarly, using that | cos(u) — 1| < |u|, that X is a bounded interval and the consistency of 
and m„, we obtain that 


sup 

t€X,a&Sp 


cos 


- At) 

Hence, the proof will be complete if we show that 


- 1 


0 


(B.5) 


i) ^ Ws^n{t, a) = y/nPn sin ta'^S 


converges to a Gaussian process and 


b) supiejag 5 ^ ^/nPnSin ta'^Vn-/j) - i/nP„ sin ta'^X ^/^(X-/^) 


0 . 


_ 1 /Q 

Effectively, if a) and b) hold the process ^/nPn sin ta^Vn ' (X — /j) is tight, so, using (|B.5p . we 
can write v^VEi,„(t,a) = v^VF 3 ,„(t,a) +i2i,„(t,a) , where sup^^j a)| 0, which 

together with (jB.4p leads to 


V^P^sin ta'^V„^/^(X - m„) = ta^T, ^ (X* -/r) Om (||X (Xj -/j) 

i=l 


+^/nPnSin ta^Yl ^/^(X —/r) +P„(t,a), 

where sup^gj^ag^^ \Rn{t,a)\ 0. 

The proof of a) follows from Ghosh and Ruymgaart (1992), so it only remains to show b). 

To derive b), denote Z*{t, a. A) = Pn sin [ta'^A(X — /r)]. Then, we have that EpZ*(t, a. A) = 0, 
since X ~ Tp(/x, S, V’)- Note that F* = {/(x) = sin(ta^A(x — /x)), t G X, a G Sp, A G : || A — 
< 1} C Gq for some C > 0 where = {/b(x) = sin(b^S“^/^(x—/x)), b G : ||b|| < C}. 
In the Appendix C, it is shown that G^ is Donsker, which entails the uniform equicontinuity leading 
to b). □ 


Proof Theorem [3l 2. As in Theorem [3ll, let yVn{t,a) = y/nPn sin ta^V^ (X — m„ x 
We will show that the process Wn = a) , (t, a) G X x 5p} converges in distribution to the 

Gaussian process W* = {>V*(t,a) , (t, a) G X x Sp} with >V*(t, a) = Wit, a) + {1/R)Bi{t,a). 

As in the proof of Theorem[3ll, we have that >Vn(t,a) = ^/nWi^nit^a) — y/nW2^n{t,a) where 


Wi^nit.a) 

W2,n{t,a) 


= cos 


= sm 


(“n.X - R) 
(™",X - R)) 


Pn sin 
Pn COS 


(X-/x)) 

(X-/x)) 
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Besides, we also have that 


sup 


sup 

t^'X^di^Sp 


cos 




n.X 


n,X 


Sin 


ta^V, 


n,x ' (nin,5 


X^ (“ln,X - 


-M)) 

- ^^)) 

/^)) 



(B.6) 

(B.7) 


hold since V„^x S and m„ x ~ M 0- 0^^ the other hand, from ([6|) and nsing that m„_z 
satisfies ([31) together with the fact that V„^x we get that 



(m„,x - 


/^) 


1 

-j= Y, (Zi - p) (||S-V2 (z, - p) ll) + 


(B.8) 


where Rn 


0. Hence, if we show that 


snp 

teX,a€Sp 


Pn COS 





(B.9) 


we obtain the following expansion for y/nW 2 ^n{t,Si) 


VnW2,n{t 


1 ^ 

a) = (Zj - p) am f (Z^ - p) ||) + fi2,n(t,a) 

^ 2 = 1 


where snp^gx^ag^^ |ii 2 ,n(t,a)| ^ 0. 

To obtain (IB.9D . notice that from the proof of Theorem [31 1, we have that 


snp I Pn cos 

t^X ^3.^Sp 


taTV„j/'(Z-/x)l -V.(t2)| J4o 


(B.IO) 


since Z ~ Tp(/i, 5], i/^) and V„^x 51. Besides, using that | cos(u) — cos(u)| < |u — ?;| and 
Xi = Zj + Yj/n“, we get the bound 


snp I Pn cos 

t^X .,3.^Sp 


(X-p) 


— Pn COS 




< 

' 'n m 


— '' 72 ,max 


■PnllYll 




where A^^max stands for the largest eigenvalne of the matrix x- Therefore, using that E|| Y|| < oo 
and V„_x El, we get ()B.9I1 from (IB.IOD . 

Denote as a. A) = P„ sin [ta"^A (X — /x)] and a. A) = sin [ta"^A (Z — /x)]. 
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Then, noting that Wi^n{t, a) = 
that ^/nWl^n{t,SL) = 


■■ cos 


a 


(™n,x - a, from p.6p . we obtain 

) + i?i,n(t,a), with supjgi^aGSp l^i,n(i,a)| ^ 0, if 


^ sup |iV4,x(i,a,V„^^')| =Op(l) (B.ll) 

t&X,a&Sp 

holds. 

Recall that, from the proof of Theorem [311, ^/nMn,zit,8L,Y^^ ) converges to a Gaussian process 
since V„^x Let 5 be such that ||A — S“^/^||s < 6 entails that |Amax(ASA^) — 1| < 

6o/h' where 6o = mini<m<£((5m) and 6m are given in assumption f). Denote T = {/(y, z) = 
(ta'^Ay)^ sin(^)(ta'^A(z — /x)), {{t, a, A) e A}, where A = {(t, a. A) : t S X, a £ Sp, || A — < 

6}. Then, the proof will be completed if we show the following convergences 


sup 

(t,a, A)G-4 


\/ra{A/„,x(t,a, A) - Mn,z{t,a, A)} - | (ta'^AY)^ sin(^)(fa'^A(Z - /i 


sup|P„/(Y,Z)-P/(Y,Z)| ^0, 


^ 0, (B.12) 
(B.13) 


for some 6 small enough, since (IB.12li and (IB.13P entail (IB.lip and also the desired expansion. For 
any 1 < s < i, define 


Gs = {5b(y, z) = (b^S-VSy)* sin(^)(bTS-V2(2 _ b £ : ||b|| < z/ + <5o} . (B.14) 


The proof of ()B.13p follows using Lemma 3.10 of van de Geer (2000) and similar arguments to those 
considered in proof of Theorem [3l 1 applied to the classes of functions T and since T <Z Qi and 
the envelope G(y,z) = sup||b||<c l5'b(y,z)| < C'^||y||^ S L^{P) with C = {v A 5o)Amax(S"^). 

It only remains to prove ()B.12h which follow if we show that sup|n,j|<,^+ 5 |j |Yn(b)| 0 where 


Yn{h) = Vn{RnSin b'^S“L2 (x _ -Pnsin (Z -/i) | 


-Ip 




■L2Yysin(^)(b'^S"L2(z 


Using a Taylor’s expansion and noting that — fx = (Zj — fx) + Yijn^ and a = l/{2t} we have 
that (l/n)X)(Li sin b'^E’^a -/i) = (1/n) sin b'^S“L2 (z. _ Si^n +S 2 ,n + 
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where 


51, n 

5 2 , n 

53, n 


1 A 1 1 


e -1 


i=l s=l 


sin(^)(b^S -/^)) = ^ ^ ^ 5i,„, 

n s! n“^ L J s! n“* 


S = 1 


1 1 1 


n"^ i\ n 


^ sinW(bTS-^/2(2. _ 


2=1 


n"(^+i) (f + 1)! n ^ 
^ ' 2=1 




with = 6'„b'^S“^/^(Zi - /^) + (1 - 6'n)b'^S“^/^ (X* - /i)), for some e (0,1). Thus, yn(b) = 
\Ai('S'i,n + 'S's^n) so to couclude the proof, we only have to show that \Aj'Sup|n,||<^jj_,_j, |S'j^n| 0, 

for j = 1, 3. Using that ia = 1/2, we get that, for any ||b|| < 5q + v, 


Iv^'S's^nl < 


1 1 

nh (^ + 1)!' 


b^S 


~1/2y^ 


l+l 


< 


1 1 

^ (^ + 1)! 


C^+^P„||Y|K+^ 


where C = (i^ + 5o)Amax(X ^), which entails that ^/nS3^n 0 since E||Y||^'''^ < oo. 

To obtain that ■\/nsup||)-,||<c |<S'i,n| 0, it is enough to show that, for any 1 < s < ^ 

-y/nsup||ij||<c |<S'i,n,sl/?T'“^ 0 which will follow if we prove that 


- 1 , 


Vn sup = Op(l). 

l|b||<C 


(B.15) 


Note that by assumption f), for any s < ^, E | b"^S sin^®^(b'^S — / 2 ))| = 0, when 

||b|| <60 + 12 . Thus, EiSi^n^s = 0 holds, so (jB.15jl follows from the fact that the class Qs defined in 
()B.14(1 is Donsker which is derived in the Appendix C. □ 


Appendix C: Proof that = {/b(x) = sin(b'^E — /r)), b G : 


|b|| < C} and Qg defined in (IB. 141) are Donsker 


When Ep||X — /^|p < 00 , the result follows easily from Lemma 2.5 in van de Geer (2000) and 
Theorem 2.7.11 in van der Vaart and Wellner (1996). However, since we do not assume this moment 
condition, we have to work more carefully and we will use the fact that \/log Hudu < 00 . 
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To provide a unified proof, denote as 


= { 5 'b(y,z) = (b'^S ^/^y)®sin(^)(b'^S -/i)), b e : ||b|| < C} . 

Hence, when s = 0, = Qq while for C = u + 60 we get the class defined in ()B.14p . It is then, 

enough to show that Gs is Donsker when ^/log^Judu < 00 if s = 0 or if Y and Z are independent 
while, if 1 < s < ^ and Y and Z are not independent we will use that for any 1 < s < i, 
fg \/log Huidu < 00 , where q = ij{i — s), sini^i(ti) = (—l)^sin(ti) or sini^i(u) = (—1)^+^ cos(u) 
and that E||Y|p^ < 00 . 

For simplicity, denote Bp{h,6) = {u : ||u - b|| < d}, Bp{6) = Bp{0,6), ||/||l 2 (p) = (Ep/2)V2^ 
A = and H = 2A^ {3 (E||Y||2«)^/' + (E||Yf^)^}. 

For any fixed p > 0 and /3 G Bp{C), define T(y,z,/3,/9) = sup{bGBp(c)nBp(/ 3 ,p)} bb(y, z) - 
5 ^(y, z)|. Note that the continuity of the sinus entails that the supremum can be taken over Q^, so 
that ^(x,/3, p) is measurable for each /3 and p > 0. Note that | sini®i(rt) — sin^® >wi < I u — v\ entails 
that | 5 bi(y,z)- 5 'b 2 (y,z)| < A®||y||®max(||bi - b 2 ||, ||bi - b 2 ||®) |l + C'^||S:"^/2(z - /i)||| . Then, 
if5=||5]-V2(Z-;.)|| we obtain the bound 

EpT2(Y,Z,/3,p)I[o,m](S) <E||Yf^A2«{l + C*M}Vax(p2,p2.) ^ (^.1) 

On the other hand, since | 5 'bi(y)Z) — 5 'b 2 (y)Z;)| < A^||y||^ {||bi — b 2 ||^ + 2(7®}, we have that 
'I'^(Y, Z,/3, p) < A^® {/9^® + 2(7®}^ II Y|p® and so, using the dominated convergence Theorem we 
get that Ep'I'^(Y, Z, /3, p) —)• 0 as /? —)■ 0. 

For a given 0 < e < min(l,H), let rj = e/A, choose as the smallest value such that P(5 > 
Afe) = P(||S~^/^(X — / 2 )|| > Mf:) < and define = min{ry,r//[(7®Afg]} < 1. Then, 

since max (/9g,/9g®) = pi, we get that 

EpT2(Y,Z,/3,pg)I[o,M.](5) < E||Y||2®A2®|min(r?,^)+C®A7g min(r?,^)| 

< E||Yf®A2®47/2 (C.2) 

Let N = N[p^,Bp{C)) the minimum number of balls of radius p^ and center in Bp{C), needed to 
cover the set Bp{C). Then, N is at most twice the number of balls of radius p^ needed to cover the set 
Bp{C) for which a bound is given in Lemma 2.5 in van de Geer (2000). Hence, there exist bi,..., b^r. 
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hj e Bp{C) such that t3p{C) C pj. Define Uj{y,z) = g\ 3 .{y,z) + ^'(y,z,bj, and 

ij{y,z) = g]j.{y,z) — ^(y, z, bj,pe). Then, for any b £ Bp{C), there exists 1 < j < such 
that b £ Bp{hj,pt.), so that |pb(y,z) — pbj(y,z)| < 'I'(y,z,bj,p^) which entails that ij{y,z) < 
5 'b(y,z) < Uj{y,z). On the other hand, T(y,z,bj,pe) = Tij(y,z) + T 2 j(y,z) where Tij(y,z) = 
^(y,z,bj,Pe)I[o,M.](l|S"^/^(x -/x)||) and T 2 ,j(y,z) = T(y,z,b^,p,)I(A^^_+oo)(l|S~^/^(x - pt)||). 
Note that using ()C.2p . we get that 

||'hij||L2(P) < 2 (E||Yf X^g = 2 (E||Y||2^)'/' ^ . 

On the other hand, the fact that 'h^(y, z, bj, p^) < {p® + 2C^}^ l|Y|p^ entails that 

||'h2,,||L2(p) < X^pt+2C^}{n\Yf^Is>My^" 

< (E||Y||2^)^/^A*p + 2A"C"{E(||Y||2^l5>AfJ}^^^ (C.3) 

We will apply Holder inequality with p = £js, so that 1/p = s/i and q = ijit — s). Then 

E||Yf*Is>M, < (E||Y||2^?')r (EIs>mJ^ = {e||Y||2^}^ {P(5 > M,)}^ 

so, using that g = e/A we get 

||^2j||l2(p) < (E||Yf")^/^A®^ + 2A"C"{E||Yf^}^{P(5>M,)}^ 

< |(E||Yf")^/^A* + 2A"C'"|E||Yf^}^| 1 

so that 


Uj - ^j\\L^(P) 


< 


2||'I'1J + 'I'2j||p2(p) < 2||Tij||p2(p) + 2||T2j||p2(p) 

|6(E||Yf")^/^ A" + 2A*0"|E||Yf^|^| ^ = e. 


Summarizing, we have shown that the bracketing number Y’j] (e, L^(P)) is smaller or equal than 

N{p^,Bp{C)) which, from Lemma 2.5 in van de Geer (2000), may be bounded as 

N{p,,Bp{C))<2 ■ 
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Note that if pe > C, N{p^,Bp{C)) < 2 x 5^, otherwise N{pf:, Bp{C)) < 2 (hC/p^f. Thus, 


Ny^{e,Qs,L^{P)) < 2 max 




< 2 max 







(C.4) 


Note that in ()C.4h we can always assume that Mg > 1, otherwise we take Mg = 1 which gives 
an upper bound, so that Mg = Thus, if we denote as ^ = plog(5) +plog(5C') + 

plog(5C'^'''^^) + log(2), we have the following bound 


— I du 

u, 


yjlog {N\^-\{u,Qs,L‘^{P)))du < J ^A + plogMu + 2plog 

< VA + VpJ \/logM„ du + J ^^log 


du < oo , 


since by hypothesis /J Vlog M^du = y^log ff^u/AY^u = A f^ ^/log^^^du < oo, where q = 
i/{i — s), concluding the proof. 


Note that if s = 0, the condition fg \/log Hudu < oo suffices to prove that Gs is Donsker. 
Furthermore, if Y, y Zj are independent the assumption fg y/log Ifuidu < oo is also weakened to 
fg \/log Hudu < oo. Indeed, in this case, we define A = 2A^ {3 + C^} and we choose 

Afg as the smallest value greater or equal than 1 such that P(5 > Mg) = P(||I]“^/^(X — /x)|| > 
A/g) < that is, Mg = and using the independence in ()C.3I) . we get 


^2,illL2(P) < (E||Yf")^/^A"4 + 2A"C'*(E||Y||2"P(5>Mg))^/^ 

./I 

< |(E||Yf")^/^A" + 2A*C" (E||Yf")^/^| j, 


so that \\uj —^j||p 2 (p) < e. So, as above, the bracketing number Y[](e, Qg, Lp‘{P)) is smaller or equal 
than N{pg,Bp{C)) getting the bound (IC.4p . Hence, if we denote as H = plog(5) +plog(5C') + 
plog(5C'^+^H) + log(2), we obtain 


J y^log {N[]{u,Gs,L‘^{P)))du < VA + Vp j \/logM„ du + J ^log < oo 

since by hypothesis fg ^/^ognfdu = fg ^f\og H(^u/A)du = A ff^ y/\oglJfdu < oo. 


35 
































n = 200 


CO 


n = 20 


n = 50 


n = 100 


0 . 20 - 

0.15- 

0 . 10 - 

0.05- 

0 . 00 - 



“ 1 —I—I—I—I—r 

0 1020304050 

Tr.cl 



“I-1-1-1-r 

0 25 50 75 100 

Tr.cl 



T, 


n,CL 


0.25-1 
0 . 20 - 
0.15- 
0 . 10 - 
0.05 - ). 

0 50 100150200 

Tr.cl 


0.00 



T, 



n,DS 


T, 



n,DS 


T, 



n,DS 


T, 


n,DS 


0.25- 

0 . 20 - 

0.15- 

0 . 10 - 

0.05- 

0 . 00 - 



“ 1 —I—I—I—I—r 

0 1020304050 

Tn,S 





T, 


n,S 


T, 


n,S 


T, 


n,S 


Figure 1: Density estimates of Tn,cL, Tn,Ds and Tn,s. Black and dotted line corresponds to Hq^\ blue and dash line to green and 

solid line to orange and dot-dashed line to red and long-dashed line to and violet and two-dashed line to 
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Figure 2: Density estimates of ?n,Ds and Tn^s- Black and dotted line corresponds to Hq , blue and dash line to green and 

( 2 ) (2) (1) (2) 
solid line to orange and dot-dashed line to ^ red and long-dashed line to H* and violet and two-dashed line to . 
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Figure 3: Density estimates of ?n,Ds and Tn^s- Black and dotted line corresponds to Hq , blue and dash line to green and 

(3) (3) (1) (2) 

solid line to orange and dot-dashed line to red and long-dashed line to H* and violet and two-dashed line to . 
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Figure 4: Density estimates of ?n,Ds and T^^s- Black and dotted line corresponds to blue and dash line to green and 

solid line to orange and dot-dashed line to red and long-dashed line to and violet and two-dashed line to 
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Figure 5: Density estimates of T^^cl, Tn ,bs and Tn^s- Black and dotted line corresponds to Hq , blue and dash line to green and 

solid line to orange and dot-dashed line to 5 , red and long-dashed line to and violet and two-dashed line to 
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Figure 6 : Density estimates of Tn, cl, Tn,m and Tn^s- Black and dotted line corresponds to Hq , blue and dash line to -^ 10 . 5 , green and 
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Figure 7: Density estimates of 7n,Ds and Tn^s- Black and dotted line corresponds to Hq , blue and dash line to green and 

(7) (7) (1) (2) 

solid line to orange and dot-dashed line to ^ red and long-dashed line to H* and violet and two-dashed line to . 
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